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Abstract 

Background: Persistent infection witli oncogenic Human Papillomavirus (HPV) is associated with the development 
of cervical cancer with each genotype differing in their relative contribution to the prevalence of cervical disease. 
HPV DNA testing offers improved sensitivity over cytology testing alone but is accompanied by a generally low 
specificity. Potential molecular markers of cervical disease include type-specific viral load (VL), integration of HPV DNA 
into the host genome and methylation of the HPV genome. The aim of this study was to evaluate the relationship 
between HPV type-specific viral load, integration and methylation status and cervical disease stage in samples 
harboring HPV16, HPV18, HPV31 or HPV45. 

Methods: Samples singly infected with HPV16 (n = 226), HPV18 (n = 32), HPV31 (n = 75) or HPV45 (n = 29) were 
selected from a cohort of 4,719 women attending cervical screening in England. Viral load and integration status 
were determined by real-time PGR while 3'Ll-URR methylation status was determined by pyrosequencing or 
sequencing of multiple clones derived from each sample. 

Results: Viral load could differentiate between normal and abnormal cytology with a sensitivity of 75% and a 
specificity of 80% (odds ratio [OR] 12.4, 95% Gl 6.2-26.1; p < 0.001) with some variation between genotypes. Viral 
integration was poorly associated with cervical disease. Few samples had fully integrated genomes and these 
could be found throughout the course of disease. Overall, integration status could distinguish between normal 
and abnormal cytology with a sensitivity of 72% and a specificity of 50% (OR 2.6, 95% Gl 1 .0-6.8; p = 0.054). 
Methylation levels were able to differentiate normal and low grade cytology from high grade cytology with a 
sensitivity of 64% and a specificity of 82% (OR 8.2, 95% Gl 3.8-1 8.0; p < 0.001). However, methylation varied widely 
between genotypes with HPV18 and HPV45 exhibiting a broader degree and higher magnitude of methylated 
GpG sites than HPV16and HPV31. 

Conclusions: This study lends support for HPV viral load and GpG methylation status, but not integration status, 
to be considered as potential biomarkers of cervical disease. 
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Background 

Persistent infection with oncogenic genotypes of genital 
Human Papillomavirus (HPV) is associated with the devel- 
opment of cervical cancer, a significant cause of morbidity 
and mortality of women worldwide [1]. Precancerous 
cervical disease is classified by cytological (low [LSIL] 
or high grade [HSIL] squamous intraepithelial lesions) 
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and histological stages (cervical intraepithelial neoplasia 
[CIN] grades 1 to 3). There are about a dozen HPV types 
associated with the development of cervical cancer [2], 
differing in their relative contributions to the prevalence 
of cervical disease [3]. Testing for the presence of onco- 
genic HPV DNA offers improved sensitivity, though 
lower specificity, than cytology alone [4] while the next 
generation of molecular tests, including those with lim- 
ited genotyping capability, may improve upon this [5]. 
Other potential molecular markers of cervical disease 
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include type-specific viral load (VL), integration of HPV 
DNA into the host genome and methylation of the HPV 
genome. An improved understanding of the role of 
these potential molecular markers in cervical disease 
development may shed some light on HPV pathogenesis 
and may be helpful to guide future cervical cancer 
screening or treatment algorithms. 

HPV DNA VL, usually estimated as the amount of HPV 
genome copies per cell, has been variably associated with 
cervical disease. Some studies were able to use HP VI 6 VL 
to differentiate between high grade (HSIL, CIN2+) and 
low grade (LSIL, CINl) disease [6-8], between cervical 
cancer and lower grades of disease [9-11], or between any 
grade of cervical disease and normal samples [12]. Other 
studies could not find any association between HP VI 6 
VL and cervical disease [13,14]. There are few studies 
examining any potential link between VL and cervical 
disease for other HPV types with some finding a posi- 
tive association between some stages of disease for 
HPV18 [8,9], HPV31 [8], HPV33 [8], and HPV52 [9] 
while others have not [9,14-17]. 

HP VI 6 integration status has been able to distinguish 
between HSIL and LSIL samples [6], between cervical 
cancer samples and those of a lower grade of disease 
[9,10], or in some studies a strong positive correlation with 
increasing disease severity has been found [18]. In other 
studies no relationship between cervical disease grade and 
HP VI 6 integration status was apparent [11-13,19]. For 
HP VI 8 there are fewer studies overall with some finding 
an association with disease [9,18] and some not [16]. One 
study [18], found a strong positive correlation with in- 
creasing disease severity for HPV31, HPV33 and HPV45 
while not for HPV52 [9,15] and HPV58 [9,17] in others. 

Methylation of CpG sites within LI [20-22], the up- 
stream regulatory region (URR) [20,21,23] and/or other 
regions of the HP VI 6 genome [24-26] have often, but not 
always [27,28], been associated with cervical disease. Data 
on the degree of CpG site methylation for genotypes 
HPV18 [25,29,30], HPV31 [29] and HPV45 [29] are lim- 
ited but appear to show a similar trend, suggesting that 
HPV methylation may be useful as a potential marker 
for cervical disease [31]. 

Some studies have examined both VL and integration 
status for HPV16 [6,7,9-13] but for other types including 
HPV18 [9,16], HPV52 [9,15] and HPV58 [9,17] the 
sources are limited. The VL of samples containing fully 
integrated HPV tends to be lower than that found in 
samples containing purely episomal or mixed forms 
[6,7,10,15], although this does not always appear to be 
the case [16,17]. Fewer studies have examined methyla- 
tion status in relation to other parameters and then only 
for HPV16 infection [22,32]. 

Mixed infections are common throughout the course 
of cervical disease [3]. Few of these studies have 



explicitly used, or separately analyzed, samples harboring 
a single HPV type wherein the association between the 
HPV type under evaluation and cervical disease can 
be made with some confidence. Within these limited 
number of studies, the VL of samples harboring single 
infections has been associated with disease severity 
in some [7] but not in other studies [13,14,17]. For 
integration-based studies that explicitly mentioned the 
use of single infection samples, one study found an 
association between HPV16 integration status and 
disease [7] while another did not [13]. The only study 
explicitly to examine methylation levels in single infec- 
tions (HPV18 and HPV31) demonstrated that some 
CpG sites exhibited higher methylation levels in CIN3 
cases harboring single infections than in CIN3 cases 
with multiple infections with these types [29]. 

In this study, we evaluate the DNA viral load, integra- 
tion and CpG methylation status of women singly in- 
fected with HPV16, HPV18, HPV31 or HPV45 in order 
better to understand the potential role for these molecu- 
lar markers in cervical disease. 

Methods 

Samples 

The present study made use of DNA (archived at -25°C 
for 2-3 years) from individuals singly infected with 
HPV16 (n = 226), HPV18 (n = 32), HPV31 (n = 75) or 
HPV45 (n = 29) from a cohort of 4,719 women attending 
cervical screening in England [33]. The age distributions 
within these monospecific infection groups were similar 
for HPV16 (median 39 [inter-quartile range, IQR, 31 - 48] 
years), HPV18 (42 [32 - 51]), HPV31 (41 [33 - 51]) and 
HPV45 (40 [26 - 43]) (j^ = 0.222 Kruskal-Wallis test). 
Accompanying histological data were available for ca. 15% 
of the cytology samples in the total study cohort. Amongst 
these, CIN2+ was diagnosed in 18% of borderline or mild 
dyskaryosis and in 79% of moderate or severe dyskaryosis. 
For analytical purposes, cytology grades of borderline 
or mild dyskaryosis were categorized as low grade (LG) 
cytology and cytology grades of moderate or severe 
dyskaryosis as high grade (HG) cytology. The testing of 
residual, anonymized DNA extracts for the purposes 
of improved understanding of cervical disease was 
approved by the Harrow Research Ethics Committee, 
UK (08/H0719/17). 

Cell lines C33A (HTB-31, HPV negative), CaSki (CRL- 
1550; high copy HPV16), SiHa (HTB-35; low copy HPV16) 
and HeLa (CCL-2; HPV18) were from the American 
Type Culture Collection (LGC Standards, UK). Full 
genome plasmids were kindly provided by the German 
Cancer Research Centre (E.M. de Villiers: HPV16, HPV18, 
HPV45) and Qiagen Gaithersburg Inc., USA (A. Lorincz; 
HPV31). Human glyceraldehyde 3-phosphate dehydro- 
genase (GAPDH) and HPV E6 plasmids representing 
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each type were made by insertion of PGR product into 
pCR2.1-TOPO (Invitrogen). The indicated reference 
sequences for HPV16 (K02718), HPV18 (X05015), HPV31 
(J04353) and HPV45 (X74479) were used (http://pave. 
niaid.nih.gov). 

DNA viral load 

PGR primers and probes targeting HPV E6 and GAPDH 
(Additional file 1: Table SI) were optimized for the ABI 
7500 Fast PGR machine (Applied Biosystems) using Plat- 
inum UDG Supermix (Life Technologies). Full genome 
plasmid standards from 10^ - 10^ copies per reaction 
yielded median amplification efficiencies, linearity (r^) 
and the GV% of inter-assay Gt values of 96% (inter- 
quartile range, IQR, 92 - 98%), 0.998 (0.996 - 0.998), 
3.1% (2.0 - 5.2%), respectively. VL is presented as HPV 
copies per cell (c/c) as determined by the viral copies 
per reaction/ (GAPDH/2) copies per reaction. A positive 
control of pooled HPV16, HPV18, HPV31 and HPV45 
DNA from a mixture of samples demonstrated good 
reproducibility: HPV16 median VL 0.39 c/c (IQR 0.32 - 
0.61; n = 24), HPV18 1.66 c/c (1.50 - 1.88; n = 6); 
HPV31 0.22 c/c (0.14 - 0.25; n = 9) and HPV45 0.22 c/c 
(0.20 - 0.26; n = 9). 

Viral integration 

The integration status of HPV16 [13], HPV18 [16], 
HPV31 and HPV45 was assessed using a ratio of the E2 
gene copies over the E6 gene copies per reaction. Two 
estimates were made using an amino-terminal (Nt) and a 
carboxy-terminal (Gt) E2 PGR (Additional file 1: Table SI). 
PGR amplification was carried out on the ABI 7500 
Fast PGR platform using Platinum SYBR green qPGR 
SuperMix-UDG (Life Technologies). Discriminatory power 
was determined using genome (representing episomal 
DNA) and E6 only (representing integrated DNA) plasmids 
in a background of G33A cells to simulate integration 
proportions of 0, 20, 50, 80, and 100% for each type. 
Linearity of these calibrators was good (median 0.998 
[IQR 0.994 - 0.998]). The lower limit of the 99% confi- 
dence interval (GI) for E2/E6 ratios obtained using the 
full length plasmid (0% integration) could be differen- 
tiated from the upper 99% GI of the E2/E6 ratios for 
samples containing a simulated integration level of 20% 
(j9 < 0.01). This empirical threshold was used to differenti- 
ate between samples bearing fully episomal or mixed 
forms of the virus [7,11]. If the E2/E6 ratio for both Nt 
and Gt E2 fragments was above the respective empirical 
threshold, the sample was considered as bearing only 
episomal forms of the virus. If one or both fragments 
were not amplified, the sample was considered to bear 
fully integrated virus, otherwise the sample was desig- 
nated as containing mixed forms of the virus. 



CpG methylation 

The degree of GpG site-specific methylation within the 
3'Ll-URR regions of the HPV genome was estimated 
using methylation-specific PGR (Additional file 1: Table SI) 
of bisulfite-treated DNA (EZ DNA Methylation-Gold 
kit; Zymo Research) followed by pyrosequencing (HP VI 6) 
[34] or direct sequencing (HPV18, HPV31, HPV45) of five 
pGR2.1-TOPO clones per sample. The breadth and mag- 
nitude of GaSki, SiHa and HeLa cell GpG methylation 
were as expected [30,34]. GpG site methylation within the 
3'Ll-URR region of GaSki (n = 9 pGR2.1-TOPO clones) 
and SiHa (n = 10 clones) cells was similar to that obtained 
by pyrosequencing (Wilcoxon paired signed rank test, 
j9 = 0.400). Methylation levels of extracted SiHa DNA 
(n = 4) stored at -25°G for ca, 3 years were essentially 
the same as freshly cultured and extracted SiHa cells 
(n = 17; Pearson s r =0.996; p < 0.001). 

Statistical analysis 

The Mann Whitney U test and the test for trend were 
used to evaluate differences between two groups and 
three independent groups, respectively. Kruskal-Wallis 
was used to test for differences between multiple groups. 
The Wilcoxon paired sign rank test was used to test for 
differences between two groups of paired data. 

Receiver operating characteristic (ROG) analyses were 
used to evaluate whether a parameter could differentiate 
between cases (HG with or without LG cytology sam- 
ples) and controls (normal with or without LG cytology 
samples). The area under the curve (AUG) is a measure 
of how well a parameter could differentiate between 
these two groups (cases and controls), where no differ- 
entiation yields a value of 0.500 (equality) and scores of 
0.800 or above are considered strong. The crossing point 
for the ROG curve was derived using the maximum 
Youden index (J) yielding the optimum balance of sensi- 
tivity and specificity and the resulting threshold of the 
parameter under study. The Fisher s exact test was used 
to test for differences in proportions between cases and 
controls with crude odds ratios (95% GI) also given. 

Tests were 2-tailed where appropriate and all tests 
were carried out using Stata 12.1 (StataGorp, USA). 

Results and discussion 

Viral load 

The distribution of HPV16, HPV18, HPV31 and HPV45 
DNA VL by cytology grade is shown in Figure lA. The 
median HP VI 6 VL in normal cytology samples (0.04 c/c 
[IQR <0.01 - 0.71]; n = 34) was lower than that for low 
grade (LG) (6.59 c/c [0.50 - 22.72]; n = 67; 0.001; 
Mann Whitney U test) or high grade (HG) cytology 
samples (6.87 c/c [1.42 - 23.65]; n = 125; p < 0.001). The 
median HP VI 8 VL in normal cytology samples (0.03 c/c 
[<0.01 - 0.03]; n = 5) was also generally lower than the 
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LG (135 c/c [0.03 - 6.08]; n = l3;p = 0.084) and HG sam- 
ples (0.20 c/c [0.08 - 1.70]; n = 14; p = 0,033), This was 
also the case for HPV31 and HPV45. Thus, for all sam- 
ples, regardless of HPV genotype, the median VL of nor- 
mal cytology samples (0.03 c/c [<0.01 - 0.64]; n = 65) 
was lower than for the LG (4.01 c/c [0.51 - 22.72]; n = 127; 

0.001) and HG cytology samples (4.62 c/c [0.98 - 
20.00]; n = 170; j9< 0.001). 

We next wished to evaluate whether VL could differ- 
entiate between normal cytology and cervical disease 
with an appropriate degree of sensitivity and specificity. 
ROC plots for normal (control) versus LG and HG 
cytology (cases) samples are shown in Figure IB while 
the resulting sensitivity and specificity plots for detec- 
tion of disease are shown in Figure 2. HPV16 VL was 



able to distinguish between cases and controls with a 
sensitivity of 70% and a specificity of 91% (AUG 0.859) at 
a VL threshold of 1.58 c/c (OR 24.5 [95% CI 7.1 - 128.2]; 

0.001). HPV18 VL was able to distinguish between 
cases and controls with a sensitivity of 81% and a speci- 
ficity of 80% (AUG 0.800) at a VL threshold of 0.04 (OR 
17.6 [1.2 - 904.3]; = 0.015). HPV31 VL was able to 
distinguish between cases and controls with a sensitivity 
of 86% and a specificity of 78% (AUG 0.862) at a VL 
threshold of 0.22 (OR 20.3 [4.5 - 101.8]; < 0.001). 
HPV45 VL was able to distinguish between cases and 
controls with a sensitivity of 80% and specificity of 78% 
(AUG 0.778) at a VL threshold of 0.51 (OR 12.0 [1.6 - 
99.0]; p = 0.005). HPV VL overall was able to distinguish 
between cases and controls with a sensitivity of 75% and 
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a specificity of 80% (AUG 0.844) at a VL threshold of 
0.73 (OR 12.4 [6.2 - 26.1]; p < 0.001). 

These data lend support to the potential for VL to be 
used to differentiate between normal and cytologically 
abnormal samples for a range of HPV types. Most other 
studies either did not include samples displaying normal 
cytology or histology [7-9], or could not make such a 
determination when they did [6,11,13-17]. Where ROC 
or similar analyses have been applied, two studies found 
that HPV16 VL could differentiate between HSIL and 
LSIL with a sensitivity of 50% and a specificity of 90% 
[6,7]. Another study found that VL levels could differen- 
tiate between low grade CIN and CIN2+ with improved 
sensitivity compared with cytology alone, but with an 
unacceptable impact on specificity [8]. Yet another study 
was able to differentiate between cervical cancer samples 
and those exhibiting CIN2 or CIN3 disease using HP VI 6, 
HPV18 and HPV52 (but not HPV58) VL levels [9]. It 
is not clear why there are such conflicting reports on 
the potential for an association between HPV VL and 
cervical disease, but the use of single infection samples 
may have helped in this respect. 

Viral integration 

A comparison of the physical status of the HPV genome 
with cytological grade is shown in Figure 3. While a 
small proportion of samples harbored fully integrated 
HPV (n = 12 of 163 samples, 7.4%), the majority con- 
tained either a mixture of integrated and episomal forms 
or purely episomal forms. Of the fully integrated ge- 
nomes (n = 12), samples lacking only the Ct E2 fragment 
or lacking both the Nt and Ct E2 fragments were more 



common (n = 4 [33%] and n = 7 [58%], respectively) than 
was disruption of the Nt E2 fragment alone (n = 1; 8%). 
Where this has been examined, studies report that dis- 
ruption is commonly found in the Nt fragment of the E2 
gene and entire E2 deletion is rare [13,15,17,35]. 

ROC analyses for the Nt E2/E6 and Ct E2/E6 ratios for 
HPV16 showed poor discrimination (AUC 0.430 and 0.415, 
respectively). The proportion of LG or HG cytology sam- 
ples containing mixed or fully integrated forms was similar 
to those of the normal samples for HPV16 (OR 2.4 [95% CI 
0.7 - 9.1]; p = 0A54) and HPV31 (OR 2.6 [0.3 - 18.0]; 

= 0.348). For HPV18 and HPV45 there were too few 
normal samples to perform an appropriate analysis. For all 
samples, regardless of HPV type, there was a tendency for 
a slightly higher proportion of samples containing mixed 
or fully integrated forms in the cases (LG and HG samples) 
than in the controls (normal samples) as suggested by the 
OR of 2.6 (95% CI 1.0 - 6.8; p = 0.054) accompanied by a 
sensitivity of 72% but a poor specificity of 50% (Figure 2). 

We used an empirical threshold [7,11] since in recon- 
stitution experiments we could distinguish between sim- 
ulated 0% and 20% integration levels. However, we also 
examined the potential for an association between inte- 
gration status and disease using a range of thresholds 
(1.0, 0.8 and 0.5) for comparison with other studies 
[6,9,15-17,19]. Reducing the threshold had the effect of 
reducing the number of mixed forms leading to a de- 
crease in the sensitivity and increase in the specificity for 
detection of LG or HG disease, although not reaching 
significance at any threshold (Additional file 2: Figure SI). 

Although some studies could differentiate between 
cervical disease stages, usually between LG and HG 
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disease using integration status [6,7,9], many could not 
[11-13,15-17,19]. Despite using samples harboring only 
single HPV types, where potentially confounding effects 
of targeting a non-disease associated type in a mixed 
infection sample should have been reduced, these data 
suggest that HPV integration status, at least as mea- 
sured by E2/E6 PGR, may not be appropriately sensitive 
for utility as a diagnostic tool for cervical disease. 
Other assays such as APOT and/or DIPS may have better 
utility in this regard [18]. 

CpG methylatlon 

The methylation status of CpG sites within the 3'Ll- 
URR fragment of HPV16 (n = 79 samples) and HPV18 



(n = 18) and of sites within the 3'Ll fragment of HPV31 
(n = 50) and HPV45 (n = 18) are shown in Figure 4. 

There were multiple CpG sites within the 3'Ll-URR 
fragment of HP VI 6 that displayed low levels of methyla- 
tion across all grades of cytology. On an individual CpG 
site basis, there were no sites whose methylation levels 
were able to distinguish normal samples from LG and 
HG samples {p > 0.05). However, there were three sites 
able to separate HG samples from LG and normal sam- 
ples: one site in LI (bp 7089; p = 0.011) and two in the 
URR (7268, p = 0.021 and 7431a, p = 0.007) (Figure 4A). 
The average methylation of these three sites could 
distinguish normal and LG (controls) from HG (cases) 
cytology samples with a sensitivity of 67% and a 
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specificity of 86% (AUG 0.768) at a threshold of 5.3% 
(OR 12.8 [95%CI 3.7 - 49.6]; p < 0.001) (Figure 2). CpG 
site methylation in the 3'Ll-URR fragment of HPV18 
was similarly low in the normal and LG samples, but 
considerably higher in some of the HG samples, ty- 
pically those classified as being severely dyskaryotic 
(Figure 4B). HG cytology samples could be distin- 
guished from normal and LG samples with a sensitivity 
of 78% and a specificity of 89% (AUG 0.901) at an aver- 
age methylation threshold of 6.7% across the nine HG 
disease-associated sites (OR 28.0 [95%CI 1.5 - 1413.3; 
p = 0.015) (Figure 2). 

Of the sites in the short 3'Ll fragment of HPV31, 
methylation of the CpG site 6950 (Figure 4C) was weakly 
able to distinguish normal and LG samples from HG 
samples with a sensitivity of 50% and a specificity of 
77% (AUG 0.656) at a threshold of 20% (OR 3.3 [95% CI 
0.8 - 13.3]; p = Om) (Figure 2). For HPV45 there were 
nine sites associated with HG disease (Figure 4D) such 
that normal and LG samples could be distinguished 
from HG samples with a sensitivity of 75% and a specifi- 
city of 100% (AUG 0.839) at an average methylation 
threshold of 16.1% {p = 0.005) (Figure 2). 

Overall, normal and LG samples could be distin- 
guished from HG samples with a sensitivity of 64% and 
a specificity of 82% (AUG 0.746) at an average methyla- 
tion level of 5.3% (OR 8.2 [95% CI 3.8 - 18.0; p < 0.001) 
(Figure 2). However, given the wide disparity in breadth 
and magnitude of methylated CpG sites between HPV 
genotypes, it may not be helpful to make such a 
generalization and perhaps highlights the need to treat 
each genotype individually. 

These observations corroborate and extend previous 
reports on HPV16 [20,21,25] and HPV18 [25,29,30] 
demonstrating that while some CpG sites in HPV16 
may be weakly associated with increasing cervical disease 
severity, HPV18-associated high grade cervical disease is 
characterized by broad CpG site methylation of a high 
magnitude. Two recent studies suggest that the methyla- 
tion status of certain key HP VI 6 CpG sites may be able to 
differentiate between cervical cancer and low grade dis- 
ease [22,36] and may even have the utility to be predictive 
[22]. Data on the methylation profiles of HPV31 and 
HPV45 are limited [29,37]. One study demonstrated 
differences between CIN3 cases and asymptomatic 
controls across a range of sites [29], while the other 
reported the proportion, but not the magnitude, of 
methylated sites in the HPV genome and a cellular gene 
increased with increasing disease severity [37]. The 
present data appear to suggest that this highly methyl- 
ated state probably only occurs late in disease. In 
addition, these data corroborate the finding that there is 
an apparent species-specific association in HPV genome 
methylation [29], given the quite different methylation 



profiles of HPV types within the alpha-9 (HP VI 6, 
HPV31) and alpha-7 (HPV18, HPV45) species groups. 



Association between molecular parameters 

Samples harboring solely integrated HP VI 6 DNA had a 
lower VL (median 0.14 c/c [IQR 0.11 - 0.30]; n = 7) than 
samples with solely episomal (1.26 c/c [0.71 - 21.25]; 
n = 38;p = 0.007) or mixed (0.99 c/c [0.43 - 22.64]; n = 
46; p = 0.008) forms. For HPV31 samples containing mixed 
forms had a higher VL (4.40 c/c [0.76 - 7.55]; n = 31) than 
those with purely episomal forms (0.44 c/c [0.15 - 0.67]; 
n = 9; p = 0.001). For HPV18, solely integrated HPV18 
DNA had a lower median VL (0.04 c/c [0.04 - 0.06]; 
n = 3) than samples with solely episomal (0.23 [0.07 - 
0.61]; n = ^;p = 0.157) or mixed (2.87 [0.20 - 5.23]; n = 11; 
p = 0.073) forms. For HPV45 there were too few samples 
containing solely episomal or integrated forms to carry 
out this assessment. Overall, samples with solely inte- 
grated HPV DNA tended to have a lower VL (0.11 c/c 
[0.06 - 0.23]; n = 12) than samples with solely episomal 
(1.00 c/c [0.38 - 7.85]; n = 51;p = 0.003) or mixed (1.38 
c/c [0.53 - 8.23]; n = 100; p = 0.001) forms while sam- 
ples containing episomal or mixed forms had similar 
VL {p = 0.238). These data support the observation that 
samples harboring fully integrated HPV DNA tend to 
have lower VL than samples containing episomal or 
mixed forms [6,7,10,15] in contrast to other studies 
[16,17,38]. 

The VL in samples exhibiting one or more methylated 
CpG sites in the 3'Ll fragment of HP VI 6 (median VL 
27.81 c/c [IQR 12.75 - 64.55]; n = 60) was similar to 
those samples that had no methylated CpG sites (18.86 
c/c [5.80 - 45.09]; n = 19; = 0.180). This was also the 
case if the VL values were segregated according to 
whether disease-associated CpG sites (7089, 7268 and 
7431a) were methylated (VL 30.17 c/c [10.29 - 62.71]; 
n = 61) or not (21.52 c/c [11.58 - 47.08]; n = 61;p = 0.752). 
For HPV31 samples exhibiting one or more methylated 
CpG sites in the 3'Ll fragment tended to have a lower me- 
dian VL (1.12 c/c [0.54 - 3.78]; n = 19) than samples exhi- 
biting unmethylated CpG sites (8.83 c/c [4.50 - 33.01]; n = 
31; 0.001). The VL of samples exhibiting methylated 
(1.62 c/c [0.57 - 3.84]; n = 12) or unmethylated CpG sites 
(5.62 c/c [2.39 - 25.37]; n = 6;p = 0.134) in the 3'Ll region 
for HP VI 8 were similar, as were the VL of samples 
exhibiting methylated (9.31 c/c [1.26 - 30.71]; n = 9) or 
unmethylated (4.51 c/c [1.92 - 11.91]; n = 9;p = 0.627) 
sites in the 3'Ll fragment of HPV45. 

There was no difference between the average 3'Ll 
methylation levels for those samples containing episomal 
(7.27% [95% CI 5.05 - 10.45]; n = 16) and those with 
mixed (7.60% [5.6 - 10.28]; n = 33) forms (p = 0.691). 
The lack of association may be confounded by having 



Marongiu et al. BMC Cancer 2014, 14:384 
http://www.bionnedcentral.conn/1471-2407/14/384 



Page 8 of 10 




Figure 4 Methylation status distribution according to disease grade and genotype. Methylation profiles for (A) HPV16, (B) HPV18, (C) 
HPV31 and (D) HPV45 with each CpG site listed vertically across the top of each profile alongside their approximate genome position. The level 
of site-specific methylation is shaded according the scale bar from 0% - 100%. NT, a site did not exist in the given sample or failed to amplify. 
^ p < 0.05, p < 0.01, p < 0.001 represent the significance level following a test for differences between normal and LG samples and HG 
samples (Mann Whitney U test). 



too few fully integrated samples to include in the ana- Conclusions 

lysis and/or the apparently poor precision of the E2/E6 The purpose of this study was to evaluate any potential 
PGR to determine the integration status. relationship between HPV type-specific viral load, Integra- 
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tion and methylation status and current cervical disease 
stage with regard to a single infecting HPV genotype, 
wherein the association with disease can be assumed 
with some confidence. 

There are potential shortcomings to this study. First, 
samples containing a single infecting genotype were 
identified using a generic PGR and genotyping test [33]. 
While this is the most common approach used and the 
test used arguably one of the more robust [39], such a 
determination is not without its problems including the 
potential for masking within mixed infections [40]. The 
selection of single infection samples is, we believe, a 
significant improvement on the use of unselected sam- 
ples, wherein the potential impact of such masking is 
likely to be far higher. Nevertheless, the possibility that 
a minority of samples in this study contained low levels 
of one or more other HPV types cannot be ruled out. 
Second, although these samples constitute a highly selected 
panel and a minority sample type collected during routine 
screening, they do nevertheless permit the evaluation of the 
potential for these markers to be used, with a limited num- 
ber of other confounding factors. Finally, although cytology 
samples are the primary sample type collected during cer- 
vical screening and are readily amenable to such testing, 
the use of cytology samples alone to improve the definition 
of cervical disease stage is problematic given the often dis- 
continuous relationship between cytology and histology 
stage designations [1]. 

In summary, even under these optimized conditions, 
the sensitivities and specificities of many of these po- 
tential molecular markers were lower than the median 
sensitivity (98%) and specificity (86%) obtained in a 
recent evaluation of commercial HPV DNA tests for 
identifying CIN2+ cases during cytological screening 
[5]. There were a few individual measurements that 
have the potential for use in triage, characterized by a 
similar or higher specificity than required for initial 
screening. HPV VL appears to have an overall specificity 
of ca, 80% (range 76 - 91%) to differentiate normal 
cytology from abnormal cytology. Conversely, HPV 
methylation appears to be able to differentiate HG cy- 
tology from normal and LG cytology with a specificity 
of 77 - 100%. Both of these measures, therefore, have 
potential for use in screening and/or triage, but the util- 
ity of HPV genome methylation status may be improved 
if the disease-associated genotype-specific CpG site 
methylation patterns can be appropriately exploited. 

Additional files 



Additional file 1: Table SI. Primer and Probes. 

Additional file 2: Figure SI. Impact of Varying Integration Thresholds. 
Top panels depict the number and percentage of samples exhibiting 
episomal, mixed or fully integrated genomes arsing from the use of a range 



of E2/E6 thresholds. Bottom panels depict the sensitivity and specificity plots 
for integration status being able to differentiate between normal and 
abnormal cytology (LG and HG) at a range of E2/E6 thresholds, p values 
refer to the proportion of mixed or fully integrated samples compared to 
episomal samples when using all samples regardless of HPV type (Fisher's 
exact test). 
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